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RECOMBINANT POLYPEPTIDES AND THEIR USES, INCLUDING ASSAY FOR AIDS VIRUS 



This invention relates to recombinant polypeptides and their uses, including assay for AIDS virus. A full 
background, and polypeptides which can be used for the same purpose, are given in EP-A-0255190, 
published 03.02.88, 

5 In particular, EP-A-0255190 discloses proteins indentified as R10, PB1, 590 and KH1. Their amino-acid 
sequences are given, as are the nucleotide sequences encoding these fusion proteins. 

Novel proteins according to the present invention are the HIV portions of proteins R10, PB1, 590 and 
KH1, the amino-acid sequences of which are respectively given in claims 1, 2, 3 and 4, and also those 
sequences without the N-terminal methionine. A further aspect of the invention comprises DNA encoding 
io the novel proteins, (examples of such nucleotide sequences are shown in the accompanying Figures 1 , 2, 3 
and 4 respectively) and transfer vectors comprising such DNA, e.g. plasmids pAPB1 and pd2PB1 . 

The novel proteins can be used in assays for detecting or quantifying antibody against HIV, e.g. when 
attached to an immunoabsorbent solid phase such as is provided by a glass or plastic bead, a well of a 
microtitre plate or a test tube. The proteins may also be incorporated into vaccine compositions, as is also 
15 generally described in EP-A-0255190. 

The following illustrates the present invention. 



Non-fusion Derivative of PB1 
20 ■ " " — — — 

A non-fusion derivative of the PB1 protein, containing no non-HlV amino-acids other than an N-terminal 
methionine, was constructed using oligonucleotide-directed site-specific mutagenesis (Inouye, S. and 
Inouye, M„ "Synthesis & Applications of DNA & RNA\ ed. Narang, Saran A. Academic Press, 1987). In this 
25 procedure, 90 non-HTLV-lll bp upstream and 39 downstream of the env gene sequence in pPB1 were 
deleted via DNA loopouts generated by hybridization with synthetic oligonucleotides. 

The oligonucleotide synthesized for the N-terminal loopout was designed so that the start codon of the 
^-glucuronidase gene is placed immediately adjacent to the 5' end of the HTLV-lll env gene sequence 
(Figure 5). The oligonucleotide includes sequences homologous to both sides of this newly created junction 
30 that allow proper hybridization to the plasmid DNA. . 

The two DNA molecules used to form a heteroduplex with a single-stranded gap that is the substrate for 
hybridization were created by digesting pPB1 with Sail and Hpal, or with Pstl linearized pPB1, and a double 
digest with Sail and Hpa l yields fragments of 3800 and 700 bp, the larger of which was gel-isolated for use 
in the mutagenesis. 

35 Kinasing of the oligonucleotide, hybridization, polymerization and ligation to yield closed circular 
molecules were done according to the methods of Inuoye and Inouye mentioned above. To enrich for DNA 
molecules containing the deletion, the DNA mixture was digested with Mlul, which cuts within the region 
being deleted. ~™ 

The digested DNA was used to transform competent E. coli JM105 cells and plasmid-containing 

40 transformants were isolated by overnight growth on YT (8 g tryptone, 5 g yeast extract, and 5 g NaCI per 
liter) Cm plates at 37 *C. 

Plasmid DNA was isolated from each transformant and screened for the correct construction by 
simultaneous digestion with Mlul and Hindlll. Molecules that were not deleted yielded fragments of 
approximately 3900 and 600 bp. Those containing the deletion did not have the Mlul site and yielded linear 

45 molecules of approximately 4400 bp. Plasmid DNA from transformants that appeared to contain the deletion 
was retransformed to ensure segregation of deleted and non-deleted plasmids and the recovery of pure 
plasmid populations. DNA from these second transformants was analyzed as in the previous digest and was 
determined to have the correct construction. This plasmid was designated pAPB1. 

To eliminate the C-terminal non-HTLV-lll amino acids, oligonucleotide-directed site-specific mutagenesis 

so was carried out as above, using the pAPB1 plasmid as a substrate. The oligonucleotide (Figure 6) was 
designed to position the TGA codon that occurs out-of-frame downstream from the env gene sequence so 
that it is immediately adjacent to the 3' end of the env gene sequence and in-frame to"act as a translational 
stop codon. 

The molecules to form the heteroduplex used for hybridization were created by digesting pAPB1 and 
Pst l alone or with Kpn l and Hpa l. The large Kpn l/ Hpa l fragment encompassing most of the vector was gel- 
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isolated for use in the mutagenesis. Kinasing, hybridization, polymerization and ligation were performed as 
above. Enrichment for deleted molecules was accomplished. by digesting with Hindlll, which cuts within the 
region being deleted. The DNA was used to transform cells as above. 

Plasmid DNA was isolated from each transformant and screened for the correct construction by 

5 simultaneous digestion with EcoRI and Hpal. The deleted plasmid yields two restriction fragments of 2900 
and 1750 bp. Plasmid DNA showing "this pattern was retransformed as above, and DNA from these 
transformants was analyzed with the same digest. This plasmid, containing N-terminal and C-terminal 
deletions, is designated pd2PB1. 

When the strain harboring plasmid pAPB1 is grown in 2% medium (2% yeast extract, bactotryptone, 

10 casamino acids [Difco, Detroit, Ml], 0.2% potassium monobasic, 0.2% potassium dibasic, and 0.2% sodium 
dibasic) containing 50 jxg/ml ampicillin and the total complement of cellular proteins electrophoresed on an 
SDS-polyacrylamide gel, a protein of approximately 22 kD can be visualized by either coomassie blue 
staining or by western biot analysis using as probe selected sera from animals immunized with recombinant 
env gene proteins. Under the same conditions, a protein of approximately 20 kD is produced in a strain 

75 containing pd2PB1. 

The technique of oligonucleotide-directed site-specific mutagenesis can be used in a similar way to 
eliminate the non-HTLV-lll amino acids flanking the env gene fusion proteins R10, 590, and KH1 . 

In the procedure detailed above, the removal of the non-HTLV-lll sequence from the fusion proteins 
involves removal of amino acids at both the N-terminus and the C-terminus of the protein and is 
20 accomplished in two sequential steps. 

It is well known in the art that a methionine at the N-terminal position can be enzymatically cleaved by 
the use of the enzyme methionine aminopeptidase (MAP). MAP has been cloned from E. coH (Ben-Bassat, 
A., Bauer, K„ Chang, S.-Y., Myambo, K., Boosman, A. and Chang, S. [1987] Journal of Bacteriology 169(2): 
751-757) and Salmonella typhimurium , and in vitro activity has been demonstrated on recombinant proteins 
25 (Miller, C.G.. Strauch, K.L, Kukral, A.M., Miller, J.L, Wingfield, P.T., Massei, G.J., Werlen, R.C., Graber, P. 
and Mowa, N.R. [1987] Proc. Natl. Acad. Sci. USA 84:2718-2722). Therefore, removal of an N-terminal 
methionine may be achieved either in vivo by expressing the protein in a host which produces MAP (e.g., 
E.coli CM89 or S. cerevisiae ), or in vitro by use of purified MAP (e.g., procedure of Miller et al.). 

30 

pd2PB1 Purification 



Unless specified otherwise, all steps are carried out at room temperature. 

as Lysis -Three 700 ml bottles of frozen cell paste containing pd2PB1 are thawed at 37° C, and are then spun 
at 4,000 rpm in a J-6B centrifuge with a JS-4.2 rotor (Beckman, Paio Alto, CA) at 4*C for 30 min. The 
supernatant is then discarded and the weight of the cell pellet is determined. The cell pellet (typically 1 kg) 
is resuspended in 2 volumes of lysis buffer (v/w) which consists of 8 M urea, 20 mM Tris-HCI (pH 7.5 ± 
0.1), 1 mM EDTA, 14.7 mM 2-mercaptoethanol and 1 mM PMSF. 

40 The resuspended cell pellet is run through a Type TDK Pilot DYNO-MILL® (Impandex Inc., Maywood, 
NJ) containing 0.5-0.7 mm glass beads at 200-400 ml/min. Prior to use the DYNO-MILL® is charged with 
one liter of lysis buffer and cooled so that the solution flowing through is at less than ambient temperature. 
The resuspended cell pellet is passed through the DYNO-MILL® twice, and after the second pass, the 
DYNO-MILL® is washed with 1 liter of lysis buffer. Lysed cell suspension and wash are pooled. 

<s 

Concentration and filtration 

The lysed cell suspension plus one liter wash is concentrated to 800 ml using a 0.45 urn. DURAPORE 
so Pellicon cassette in a Pellicon 4 GPM system (Millipore. Bedford, MA). The concentration is done with an 
inlet pressure of less than or equal to 275 kPa and an outlet pressure between 70 and ]40 kPa. After 
concentration the lysed cell suspension is filtered with 4 liters of lysis buffer using the same Pellicon 
system, cassette and pressure settings with the tubing rigged for dyafiltration. 

55 

Extraction 

The washed lysis cell suspension is extracted with 10 I of extraction buffer consisting of 6 M guanidine 
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HCI/100 mM Tris-HCI (pH 7.6 t 0.1), and 10 mM EDTA, using the same Pellicon system., cassette and. 
pressure settings as described above with the tubing rigged for dyafiltration. 



5 Buffer exchange 

The filtrate from the previous step is typically concentrated to 1 liter using a Pellicon 4GPM system with 
two PTGC cassettes (10,000 NMWL). The concentration Is done with an inlet pressure of less than or equal 
to 345 kPa and an outlet pressure between 205 and 310 kPa. After concentration, the supernatant is buffer 

w exchanged with CM column buffer consisting of 8 M urea, 25 mM potassium phosphate, and 1 mM EDTA 
(pH 6.8 t 0.1), with conductivity less than or equal to 3.0 ms/cm. For buffer exchange, the same Pellicon 
system, the same cassettes and the same pressure settings as above are used with the tubing rigged for 
dyafiltration: Eight liters of CM column buffer are used to buffer exchange 1 liter of concentrated extract. 
After buffer exchange, the buffer-exchanged extract is drained from the system and the system is washed 

75 with 1 liter of CM column buffer, the buffer-exchanged extract and the wash are pooled and the solution's 
conductivity and pH are measured. The conductivity of the solution is adjusted to less than or equal to 3.0 
ms/cm with deionized 8 M urea and the pH is adjusted to be within the range of 6.5-7.0. 



20 CM chromatography 

A 50 x 51 cm column of CM SEPHAROSE® FAST FLOW (Pharmacia, Piscataway, NJ) is equilibrated 
by washing the column sequentially with 4 column volumes of 0.5 M NaOH, 2 column volumes of deionized 
water and 2-3 column volumes of CM column buffer. The column is considered equilibrated when the pH of 

25 the outflow is within 0.2 units of the CM column buffer and the conductivity of the outflow is within 0.3 
ms/cm of the CM column buffer. 

For loading, the buffer exchanged extract is pumped on to the column at an inlet pressure between 10 
and 15 psL After loading, the CM column is washed with CM column buffer until the OO at 280 nm of the 
outflow is less than 0.1 . The pd2PB1 is then eluted with an 8-liter linear gradient of 0-0.5 M NaCI in CM 

30 column buffer and collected in 100 ml fractions. The fractions are assayed by SDS-PAGE and Western with 
anti-gp160 antibody, and those containing significant pd2PB1 and trace contaminants are pooled. 

Organic extraction 

35 

The pooled protein solution from the previous step is brought to a ratio of 55% acetonitrile to 45% 
protein solution (v/v) by the slow addition of pure acetonitrile with mixing. After addition of all of the 
acetonitrile, the solution is centrifuged in a J2-21 centrifuge using a JA10 rotor (Beckman) at 10,000 rpm 
and 4*C for 15 min. After centrifugation, the supernatant is collected and the pellet is discarded. 

40 The centrifugation supernatant is brought to a ratio of 35% ethanol to 65% supernatant (v/v) by slow 
addition of 95% ethanol with mixing. After addition of all of the ethanol, the solution is centrifuged in a J2-21 
centrifuge using a JA-10 rotor at 10,000 rpm and 4*C for 15 min. After centrifugation the pellet is collected 
and the supernatant is discarded. 

The pellet is allowed to air dry for 15 min, and is then redissolved in S-300 column buffer, which 

45 consists of 8 M urea, 0.3 M glycine, 5 mM EDTA, 15 mM 2-mercaptoethanol, 1 mM dithiothreitol (DTT) (pH 
8.50 i 0.01). The pellet is dissolved in a volume of S-300 column buffer equal to one-tenth the volume of 
the pooled protein solution at the beginning of this step. 



so Concentration 

The absorbance of the redissolved protein solution from above is determined at 280 nm and an 
approximate protein concentration is determined by assuming that a 1 mgml solution of protein has an 
absorbance of 1.0 at 280 nm. The solution is concentrated to 10 mg/ml using a 200 ml Amicon stirred cell 
55 concentrator with a YM-10 membrane. 



S-300 chromatography 
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Thirty to seventy ml of the concentrated protein soiution is loaded on a 5.0 x 135 cm column of 
. SEPHACRYL® S-300 from Pharmacia. The column had been previously equilibrates with S-300 column 
buffer which consists of 8 M urea, 0.3 M glycine, 5 mM EDTA, 15 mM 2-mercaptoethanol, 1 mM DTT (pH 
8.50 t 0.01). After loading, the column is run isocratically in the same buffer. Twenty ml fractions are 

s collected and the fractions are assayed for pd2PBl content by SDS-PAGE. 

Equal volume aliquots are taken from suitable fractions containing pd2PB1 and are used to determine 
which fractions are satisfactory for pooling. The aliquots are pooled, dialyzed overnight versus 8 M urea, 25 
mM sodium phosphate, 1 mM EDTA (pH 6.8 t 0.1), and the OD at 280 nm of the dialyzed pool is 
determined using the dialysis buffer as blank. The protein concentration of the solution is determined using 

10 the calculated extinction coefficient of pd2PB1 of 1.0 (mg/ml)" 1 SDS-PAGE is run on 10 ag of the dialyzed 
pooling using a 15% SDS acrylamide gel. After coomassie staining and destaining, the gel is scanned 
using an LKB (Gaithersburg, MD) scanning densitometer attached to a Waters (Milford, MA) 740 Integrator, 
if the pd2PB1 band on the gel is more than 97% pure, then the fractions that were used for the aliquot are 
checked for endotoxins at a 1 to 20 dilution in the Limulus Amebocyte Lysate (LAL) assay using 0.06 eu/ml 

15 tubes. If the LAL test on the diluted fractions is negative, the fractions are pooled and used for subsequent 
operations. If the gel fails to meet the purity specification, the process is repeated using equai volume 
aliquots from a different set of fractions. Only those fractions having a negative LAL test at a 1 to 20 dilution 
are pooled. 

20 
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Claims 

1 . A protein having the amino-acid sequence 

MetVslTrpLysBluAlaThrThrThrLeuPheCasAlaSerAspAlaLysAlsTyr 
AspTh rG 1 uValHisAsnVslTrpAlsThrHisAl sCasVs LP roThrAspPro 
AsnProGlriGluV3lV3lLeuV3lAsnV3lThrGluA-snPheAsriMetTrpLys 
AsnAspMetV3lGluGlriMetHisGluA5PlleI leSerLeuTrpAspGlnSer 
LeuLysProCysVBlLysLeuThrProLeuCysVslSerLeuLysCysThrAsp 
LeuLysAsriAspThrAsnThrAsnSerSerSerGlyAraMetl leHetGluLys 
GlyGluIleLysAsnCysSerPheAsnlleSerThrSerl leA rsJGlyLysVsl 
GlnLysGluTyrAlsPhePheTyrLysLeuAspIlelleProIleAspAsnAsp 
ThrThrSe rTy rThrLeuThrSerCysAsnThrSerVal IleThrGlnAlsCys 
ProLy5V3lSerPheGluProIleProIleHisTyrCysAlaProAl3GlyPhe 
AlalleLeuLysCysAsnAsnLysThrPheAsnGlyThrGlyProCysThrAsn 
VslSerThrVslGlnCysThrHisGlylleArdProValVslSerThrGlriLeu 
LeuLeuAsnGlySerLeuAlsGluGluGluVBlVslIleAraSerAlsAsnPhe 
ThrAspAsriAlaLysThrllelleVslGlriLeuAsnGlriSerValGluI 1 eAsn 
CysTh rArsSProAsnAsnAsnThrArSLysSerlleAral leGlnArsGlyPro 
GlyAr^AlsPheValThrlleGlyLysUeGlyAsnMetAr^GlnAlaHisCys 
AsnlleSerAraAlaLysTrpAsnAsnThrLeuLysGlrilleAspSerLysLeu 
AraGluGlnPheGlyAsnAsnLysThrllel lePheLysGlnSerSerGlyGly 
AspProGluIleVslThrHisSerPheAsnCysGlyGlyGluPhePheTyrCys 
AsnSerThrGlnLeuPheAsnSerThrTrpPheAsnSerThrTrpSerThrLys 
GlySerAsnAsnThrGluGlySerAspThrlleThrLeuProCysArdl leLys 
GlnllelleAsnMetTrpGlnGluValGlyLysAlsMetTyrAlsProProIle 
SerGlyGlnlleAr^CysSerSerAsriTleThrGlyLeuLeuLeuThrAraAsp 
GlyGlyAsnSe rAsnAsnGl uSer 

2. A protein having the amino-acid sequence 
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Met LeuAsriGlnSerValGluIleAsnCysThrArdProAsriAsnAsriThrArdLys 
SerlleArglleGlnAraGlyproGlyAr^AlaPheValThrlleGlyLysIle 
GlyAsnMetAr^GlnAlaHisCysAsnl leSerArdAlsLysTrpAsnAsnThr 
LeuLysGlrilleAspSerLysLeuAr^GluGlnPheGlyAsriAsriLysThrl le 
IlePhoLysGlnSerSerGlyGlyAspproGluIleVslThrHisSerPheAsn 
CysGlyG lyGluPhePhely rCysAsnSerTh rGlnLeuPheAsnSerThrTrp 
PheAsnSerThrTrpSerThrLysGlySe rAsnAsnThrGluGlySerAspThr 
IleThrLeuProCysAral leLysGlnl lelleAsnMetTrpGlnGluValGly 
LysAlaHeiTyrAlaProProIleSerGlyGlnlleAr^CysSerSerAsrille 
ThrGlyLeuLeuLeuThrAraAspGlyGlyAsnSerAsnAsnGluSer 

3. A protein having the amino-acid sequence 
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MetLeuAsnGlnSerVslGluIleAsnCysThrAraProAsnAsnAsnThrAraLys 
SerlleAr^IleGlnAr^GlyProGlyAr^AlaPheValThrlleGlyLysIle 
GlyAsnMetAr^GlnAlaHisCysAsnlleSerApdAlsLysTrpAsnAsnThr 
LeuLysGlnlleAspSerLysLeuAr^GluGlnPheGlyAsnAsnLysThrlle 
IlePheLysGlnSerSerGlyGlyAspProGluIleVBlThrHisSerPheAsri 
CysGlyGlyGluPhePheTy rCy s AsnSe r ThrGlnLeuPhe AsnSe rThrT rp 
PheAsnSerThrTrpSerThrLysGlySerAsriAsnThrGluGlySerAspThr 
IleThrLeuProCysAr^I leLysGlnllelleAsnMetTrpGlnGluVslGly 
LysAlaMetTy rAlsProProIleSerGlyGlnlleAr^CysSerSerAsnl le 
ThrGlyLeuLeuLeuThrAr^AspGlyGlyAsriSerAsnAsnGluSerG luI le 
PheArsProGlyGlyGlyAspMetAraAspAsnTrpArdSerGluLeuTyrLys 
Ty rLysVslValLysIleGluProLeuGlyValAlaProThrLysAlsLysAr^ 
Ar^VelValGlnAr^GluLysArdAlaValGlylleGlyAlaLeuPheLeuGly 
PheLeuGlyAlsAlaGlySerThrMetGlyAlaAlaSerMetThrLeuThryal 
GlnAlaArsGlnLeuLeuSerGly IleValGlnGlnGlnAsnAsriLeuLeuArd 
AlalleGluAlsGlnGlhHisLeuLeuGlnLeuThrValTrpGlylleLysGln 
LeuGlnAlsA-rdX leLeuAlaValGluArSTyrLeuLysAspGlnGlnLeuLeu 
Glyl leTrpGlvCysSerGlyLysLeuI leCysThrThrAlaValProTrpAsn 
AlaSerTrpSerAsnLysSerLeuGluGlnlleTrpAsnAsnMetThrTrpMet 

GluTrpAspAraGluIleAsnAsnTyrThr 
4. A protein having the amino-acid sequence 
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Met ValTppLysGluAlaThrThpThpLeuPheCysAlaSepAspAlaLysAlaTyp 
AspThrGluValHisAsnValTrpAlaThrHisAlaCysVslProThrAspPro 
AsnPpoGlnGluValValLeuValAsnyalThrGluAsnPheAsnMetTrpLys 
AsriAspMetValGluGlnMetHisGluAspIlelleSerLeuTrpAspGlnSer 
LeuLysProCysValLysLeuThrProLeuCysValSerLeuLysCysThrAsp 
LeuLy-sAsnAspTh rAsnThrAsnSe rSerSe rGlyA r^Metl 1 eMetGluLys 
GlyGluI leLysAsnCysSe-rPheAsnl leSe rThrSe r 1 1 eA pSG 1 yLysVa 1 
GlnLysGluTy rAlaPhePheTy rLysLeuAspI lelleProIleAspAsnAsp 
ThrThrSerTyrThrLeuThrSerCysAsnThrSerVallleThrGlnAlaCys 
ProLysVslSerPheGluProIleProIleHisTyrCysAleProAlaGlyPhe 
Ala I leLeuLysCysAsnAsnLysThrPheAsnGlyThrGlyProCysThrAsn 
ValSerThrValGlnCysThrHisGlylleArSProValValSerThpGlnLeu 
LeuLeuAsnGlySerLeuAlaGluGluGluValVal I 1 eA r^Se rAl a AsnPhe 
ThrAspAsnAlaLysThrllelleValGlriLeuAsnGlnSerValGluIleAsn 
CysThrAr^ProAsnAsr.AsnThrAr^LysSerl leApsf 1 1 eGlnApSGlyP po 
GlyArSAlaPheValThrlleGlyLysIleGlyAsnMetAr^GlriAlaHisCys 
Asnl leSerAr^AlaLysTrpAsnAsnThrLeuLysGlnl leAspSerLysLeu 
ArdGluGlnPheGlyAsriAsnLysThrllellePheLysGlnSerSerGlyGiy 
AspProGluIleValThrHisSerPheAsnCysGlyGlyGluPhePheTyrCys 
AsnSerThrGlr.LeuPheAsnSerThrTrpPheAsnSerThrTrpSerThrLys 
GlySerAsnAsnThrGluGlySerAspThpIleThrLeuProCysAr^I leLys 
GlnllelleAsnMetTppGlnGluVslGlyLysAlaHetTyrAlaPraPpoIle 
SepGlyGlnlleArsiCysSepSepAsrilleThrGlyLeuLeuLeuThpAp^Asp 
GlyGlyAsnSepAsnAsriGluSerGluIlePheApSPPoGlyGlyGlyAspMet 
A PS A sp A snT pp A p2 Se pG luLeuTy pLysTy pLysVal ValLys I leGluPpo 
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LeuGlyVslAlaProThrLysAlsLysArdArdValValGlriAr^GluLasAr^ 
AlaVslGlylleGlyAlaLeuPheLeuGlyPheLeuGlyAlaAlaGlySerThr 

5 

MetGlyAlaAlsSerMetThrLeuThrValGlriAlsAr^GlnLeuLeuSerGly 
IleValGlriGlriGlriAsnAsnLeuLeuAraAlalleGluAlaGlnGlriHisLeij . 
10 LeuGlnLeuThrValTrpGlylleLysGlnLeuGlriAlaArdlleLeuAlaVal 
GluA rsJTy rLeuLysAspGlnGlnLeuLeuGlylleT rpGlyCysSerGlyLys 
LeuIleCysThrThrAlaValProTrpAsnAlaSerTrpSerAsnLysSerLeu 

is 

GluGlnl leTrpAsnAsnMetThrTrpMetGluTrpAspArdGluIleAsnAsn 
Ty rThr 

2Q 5. A protein according to any preceding claim, which does not have the N-terminal methionine. 

6. DNA encoding a protein according to any preceding claim. 

7. A recombinant DNA transfer vector comprising DNA according to claim 6. 

8. A vector according to claim 7, which is plasmid pAPBl or piasmid pd2PB1. 

9. An immunochemical assay for detecting or quantifying antibody against HIV in a liquid, which 
25 comprises employing a protein according to any of claims 1 to 5. 

10. An immunoadsorbent, suitable for use in a solid phase immunochemical assay for antibody against 
HIV, comprising a solid phase to which is attached a protein according to any of claims 1 to 5. 

11. An immunoadsorbent according to claim 10, further comprising a post-coat of animal protein, 

12. A method of detecting antibody against HIV in a liquid, which comprises the steps of: 

3Q (a) incubating the liquid with an immunoadsorbent according to claim 10 or claim 11, under 

conditions which allow the antibody (if present) in the liquid to bind to the immunoadsorbent; 

(b) separating the immunoadsorbent from the liquid; 

and 

(c) determining the presence of antibody bound to the immunoadsorbent. 

35 

13. A method according to claim 12, wherein step (c) comprises incubating the immunoadsorbent with a 
labelled antibody against antigen of the species from which the liquid is derived; separating the immunoad- 
sorbent from the labelled antibody; and detecting the label associated with the immunoadsorbent. 

14. A method according to claim 13, wherein the labelled antibody is a labelled protein according to any 
so of claims 1 to 5. or labelled protein A. 

15. A method according to claim 13, wherein the liquid is a human serum or plasma sample; the 
immunoadsorbent comprises a bead coated with the protein; and the labelled antibody is labelled anti- 
(human IgG) antibody. 

- 16. A method according to claim 15, wherein the anti-(human IgG) antibody is an animal (e.g. goat) 
45 antibody and the serum or plasma sample is diluted with normal serum from an animal of the same 
species. 

17. A kit for use in detecting antibody against HIV, as in any of claims 12 to 16, the kit comprising: 

(a) an immunoadsorbent according to claim 10 or clam 11; 

(b) labelled HIV antibody; and 

50 (c) means for detecting the label. 

18. A protein according to any of claims 1 to 5, for stimulating a lymphocyte proliferative response in 
humans. 

19. A vaccine composition comprising one or more proteins having the antigenic properties of a protein 
55 according to any of claims 1 to 5. 
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Fig. 1 

ATG GTGTGGAAGGAAGCAACCACCACT CTATTTTGT6CATCAGATGCTAAAGCATAT 
GATACAGAGGTACATAATGTTTG66CCACACATGCCTGTGTACCCACAGACCCC 
AACCCACAAGAAGTAGTATTGGTAAATGTGACAGAAAATTTTAACATGTGGAAA 
AATGACATGGTAGAACAGATGCATGAGGATATAATCAGTTTATGGGATCAAAGC 
CTAAAGCCATGTGTAAAATTAACCCCACTCTGTGTTAGTTTAAAGTGCACTGAT 
TTGAAGAATGATACTAATACCAATAGTAGTAGCGGGAGAATGATAATGGAGAAA 
GGAGAGATAAAAAACTGCTCTTTCAATATCAGCACAAGCATAAGAGGTAAGGTG 
CAGAAAGAATATGCATTTTTTTATAAACTTGATATAATACCAATAGATAATGAT 
ACTACCAGCTATACGTTGACAAGTTGTAACACCTCAGTCATTACACAGGCCTGT 
CCA AAGGT ATCCTTTGAGCCAATTCCCAT AC ATTATTGTGCCCCG6CTGGTTTT 
GCGATTCT AAAATGTAATAATAAGACGTTCAATGGAACAGGACCATGTACAAAT 
GTCAGCACAGTACAATGTACACATGGAATTAGGCCAGTAGTATCAACTCAACTG 
CTGTTAAATGGCAGTCTGGCAGAAGAAGAGGTAGTAATTAGATCTGCCAATTTC 
ACAGACAATGCTAAA ACCATAATAGTACAGCTGAACCAATCTGTAGAAATTAAT 
TGTACAAG ACCC AACAACAATACAAGAAAAAGTATCCGTATCCAGAGAGGACCA 
GGGAGAGCATTTGTTACAATAGGAAAAATAGGAAATATGAGACAAGCACATTGT 
AACATTAGTAGAGCAAAATGGAATAACACTTTAAAACAGATAGATAGCAAATTA 
AGAGAACAATTTGGAAATAATAAAACAATAATCTTTAAGCAGTCCTCAGGAGGG 
GACCCAGAAATTGTAACGCACAGTTTTAATTGTGGAGGGGAATTTTTCTACTGT 
AATTCAACACAACTGTTTAATAGTACTTGGTTTAATAGTACTTGGAGTACTAAA 
GGGTCAAATAACACTGAAGGAAGTGACACAATCACCCTCCCATGCAGAATAAAA 
CAAATTATAAACATGTGGCAGGAAGTAGGAAAAGCAATGTATGCCCCTCCCATC 
AGTGGACAAATTAGATGTTCATCAAATATTACAGGGCTGCTATTAACAAGAGAT 
GGTGGTAATAGCAACAATGAGTCC 
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Fig. 2 

ATG CTGAACCAATCT6TAGAAATTAATTGTACAAGACCCAACAACAATACAAGAAAA 
AGTATCCGTATCCAGAGAGGACCAGGGAGAGCATTTGTTACAATAGGAAAAATA 
GGAAATATGAGACAAGCACArTGTAACATTAGTAGAGCAAAATGGAATAACACT 
TTAAAACAGATAGATAGCAAATTAAGAGAACAATTTGGAAATAATAAAACAATA 
ATCTTTAAGCAGTCCTCAGGAGGGGACCCAGAAATTGTAACGCACAGTTTTAAT 
TGT6GAGGGGAATTTTTCTACTGTAATTCA ACACAACTGTTTAATA6TACTTGG 
TTTAATAGTACTTGGAGTACTAAAGGGTCAAATAACACTGAAGGAAGTGACACA 
ATCACCCTCCCATGCAGAATAAAACAAATTATAAACATGTGGCAGGAAGTAGGA 
AAAGCAATGTATGCCCCTCCCATCAGTGGACAAATTAGATGTTCATCAAATATT 
ACAGGGCTGCTATTAACAAGAGATGGTGGTAATAGCAACAATGAGTCC 
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Fig. 3 

ATG CTGAACCAATCTGTAGAAATTAATTGTACAAGACCCAACAACAATACAAGAAAA 
AGTATCCGTATCCAGAGA66ACCAGGGAGAGCATTTGTTACAATAGGAAAAATA 
GGAAATATGAGACAAGCACATTGTAACATTAGTAGAGCAAAATGGAATAACACT 
TTAAAACAGATAGATAGCAAATTAAGAGAACAATTTGGAAATAATAAAACAATA 
ATCTTTAAGCAGTCCTCAGGAGGGGACCCAGAAATTGTAACGCACAGTTTTAAT 
TGTGGAGGGGAATTTTTCTACTGTAATTCAACACAACTGTTTAATAGTACTTGG 
TTTAATAGTACTTGGAGTACTAAAGGGTCAAATAACACTGAAGGAAGTGACACA 
ATGACCCTCCCATGCAGAATAAAACAAATTATAAACATGTGGCAGGAAGTAGGA 
AAAGCAATGTATGCCCCTCCCATCAGTGGACAA.ATTAGATGTTCATCAAATATT 
ACAGG6CTGCTATTAACAAGAGATGGT6GTAATAGCAACAATGAGTCCGAGATC 
TTCAGACCTGGAGGAGGAGATATGAGGGACAATTGGAGAAGTGAATTATATAAA 
TATAAAGTAGTAAAAATTGAACCATTAGGA6TAGCACCCACCAAGGCAAAGA6A 
AGAGTGGTGCAGAGAGAAAAAAGAGCAGTGGGAATAGGAGCTTTGTTCCTTGGG 
TTCTTGGGAGCAGCAGG AAGCACTAT6GGCGCAGCGTCAATGACGCTGACGGTA 
CAGGCCAGACAATTATTGTCTGGTATAGTGCAGCAGCAGAACAATTTGCTGAGG 
GCTATTGAGGCGCAACAGCATCTGTT6CAACTCACAGTCTGGGGCATCAAGCA6 
CTCCAGGCAAGAATCCTGGCTGTGGAAAGATACCTAAAGGATCAACAGCTCCT6 
GG6ATTTGGGGTTGCTCTGGAAAACTCATTTGCACCACTGCTGTGCCTTGGAAT 
GCTAGTTGGAGTAATAAATCTCTGGAACAGATTTGGAATAACATGACCTGGATG 
GAGTGGGACAGAGAAATTAACAATTACACA 
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Fig. 4 



ATG GTGTGGAAGG AAGCAACCACCACTCTATTTTGTGCATCAGATGCTAAAGCATAT 
GATACAG AGGTACATAATGTTTGGGCCACACAT6CCTGTGTACCCACAGACCCC 
AACCCACAAGAAGTAGTATTGGT AAATGTGACAGAAAATTTTAACATGTGGAAA 
AATGACATGGTAGAACAGATGCAT6AGGATATAATCAGTTTATGGGATCAAAGC 
CTAAAGCCATGTGTAAAATTAACCCCACTCTGTGTTAGTTTAAAGTGCACTGAT 
TTGAAGAAT6ATACTAATACCAATAGTAGTA6CGGGAGAATGATAATGGA6AAA 
GGAGAGATAAAAAACTGCTCTTTCAATATCAGCACAAGCATAAGAGGTAAGGTG 
CAGAAAGAATAT6CATTTTTTTATAAACTTGATATAATACCAATAGATAATGAT 
ACTACCAGCTATACGTTGACAAGTTGTAACACCTCAGTCATTACACAGGCCTGT 
CCAAAG6TATCCTTTGAGCCAATTCCCATACATTATTGTGCCCCGGCTGGTTTT 
GCGATTCTAAAATGTAATAATAAGACGTTCAATGGAACAGGACCATGTACAAAT 
GTCAGCACAGTACAATGTACACAT6GAATTAGGCCAGTAGTATCAACTCAACTG 
CTGTTAAATGGCAGTCTGGCAGAAGAAGAGGTAGTAATTAGATCTGCCAATTTC 
ACAGACAATGCTAAAACCATAATAGTACAGCTGAACCAATCTGTAGAAATTAAT 
TGTACAAGACCCAACAACAATACAAGAAAAAGTATCCGTATCCAGAGAGGACCA 
GGGAGAGC ATTTGTTACAATAGGAAAAATAGGAAATATGAGACAAGCACATTGT 
AACATTAGTAGAGCAAAATGGAATAACACTTTAAAACAGATAGATAGCAAATTA 
AGAGAACAATTTGGAAATAATAAAACAATAATCTTTAAGCAGTCCTCAGGAGGG 
GACCCAGAAATTGTAAC6CACAGTTTTAATTGTGGAGGGG AATTTTTCTACTGT 
AATTCAACACAACTGTTTAATAGTACTTGGTTTAATAGTACTTGGAGTACTAAA 
GGGTCAAATAACACTGAAGGAAGTGACACAATCACCCTCCCATGCAGAATAAAA 
CAAATTATAAACATGTGGCAGGAAGTAGGAAAAGCAATGTATGCCCCTCCCATC 
' AGTGGACAAATTAGATGTTCATCAAATATTACAGGGCTGCTATTAACAAGAGAT 
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Fig. 4 (cont. } 

GGTGGTAATAGCAACAATGAGTCCGAGATCTTCAGACCTGGAGGAGGAGATATG 
A6G6ACAATT6GAGAAGTGAATTATATAAATATAAAGTAGTAAAAATTGAACCA 
TTAGGAGTAGCACCCACCAAGGCAAAGAGAAGAGTGGTGCAGAGAGAAAAAAGA 
GCAGTG66AAT AGGAGCTT T6TTCCTTGGGTTCTTGGGAGCAGCAGGAAGCACT 
ATGGGCGCAGCGTCAATGACGCTGACGGTACAG6CCAGACAATTATTGTCTGGT 
ATAGT6CAGCAGCAGAACAATTTGCTGAGGGCTATTGAGGCGCAACAGCATCTG 
TTGCAACTCACAGTCTGGGGCATCAAGCAGCTCCAGGCAAGAATCCTGGCTGTG 
GAAAGATACCTAAAGGATCAACAGCTCCTGGGGATTTGGGGTTGCTCTG6AAAA 
CTCATTTGCACCACTGCTGTGCCTTG6AATGCTAGTTGGAGTAATAAATCTCTG 

GAACAGATTTGGAATAACATGACCTGGATGGAGTGGGACAGAGA AATTAACAAT 
TACACA 
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